Combining a popularity-productivity stochastic block model with a discriminative content model for detecting general structures

نویسندگان

  • Bian-fang Chai
  • Jian Yu
  • Cai-yan Jia
  • Tian-bao Yang
  • Ya-wen Jiang
چکیده

Latent community discovery that combines links and contents of a text-associated network, has drawn more attention with the advance of social medias. Most of the previous studies aim at detecting densely connected communities, and are not able to identify general structures, e.g., bipartite structure. Several variants based on stochastic block model are more flexible for exploring general structures by introducing link probabilities between communities. However, neither can these variants identify degree distributions of real networks due to lacking of modeling the differences among nodes, nor can they be suitable for discovering communities in text-associated networks due to ignoring the contents of nodes. In this paper, we propose a popularity-productivity stochastic block (PPSB) model by introducing two random variables, popularity and productivity, to model the differences among nodes in receiving links and producing links, respectively. The new model has the flexibility of existing stochastic block models in discovering general community structures, and inherits the richness of previous models that also exploit popularity and productivity in modeling the real scale-free networks with power law degree distributions. To incorporate contents in text-associated networks, we propose a PPSB-DC model which combines the PPSB model with a discriminative model that models the community memberships of nodes by their contents. We then develop EM algorithms for inferring the parameters in the two models. Experiments on synthetic and real networks have demonstrated that the proposed models can yield better performances than previous models, especially on networks with general structures.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Bayesian Framework for Community Detection Integrating Content and Link

This paper addresses the problem of community detection in networked data that combines link and content analysis. Most existing work combines link and content information by a generative model. There are two major shortcomings with the existing approaches. First, they assume that the probability of creating a link between two nodes is determined only by the community memberships of the nodes; ...

متن کامل

Directed Network Community Detection: A Popularity and Productivity Link Model

In this paper, we consider the problem of community detection in directed networks by using probabilistic models. Most existing probabilistic models for community detection are either symmetric in which incoming links and outgoing links are treated equally or conditional in which only one type (i.e., either incoming or outgoing) of links is modeled. We present a probabilistic model for directed...

متن کامل

A reliability-based maintenance technicians’ workloads optimisation model with stochastic consideration

The growing interest in technicians’ workloads research is probably associated with the recent surge in competition. This was prompted by unprecedented technological development that triggers changes in customer tastes and preferences for industrial goods. In a quest for business improvement, this worldwide intense competition in industries has stimulated theories and practical frameworks that ...

متن کامل

A stochastic model for the cell formation problem considering machine reliability

This paper presents a new mathematical model to solve cell formation problem in cellular manufacturing systems, where inter-arrival time, processing time, and machine breakdown time are probabilistic. The objective function maximizes the number of operations of each part with more arrival rate within one cell. Because a queue behind each machine; queuing theory is used to formulate the model. T...

متن کامل

The Impact of Publishing Islamic Treasury Bills on Fiscal Sustainability of the Iranian Government by Using a Dynamic Stochastic General Equilibrium Model

From the perspective of government accounting, the Publishing of Islamic Treasury Bills, due to the nature of these bonds that transfer of debt is permissible, there will be no additional financial burden for the government in the form of principal and interests of them. In other securities, on the other hand, the government is bound to pay the principal and its interests on the date of maturit...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013